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Abstract 

Modifications of quantum mechanics are considered, in which the state vec- 
tor of any system, large or small, undergoes a stochastic evolution. The 
general class of theories is described, in which the probability distribution 
of the state vector collapses to a sum of delta functions, one for each possible 
final state, with coefficients given by the Born rule. 
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I. INTRODUCTION 



There is now in my opinion no entirely satisfactory interpretation of 
quantum mechanics [1]. The Copenhagen interpretation [2] assumes a myste- 
rious division between the microscopic world governed by quantum mechan- 
ics and a macroscopic world of apparatus and observers that obeys classical 
physics. During measurement the state vector of the microscopic system 
collapses in a probabilistic way to one of a number of classical states, in a 
way that is unexplained, and cannot be described by the time-dependent 
Schrodinger equation. The many- worlds interpretation [3] assumes that the 
state vector of the whole of any isolated system does not collapse, but evolves 
deterministically according to the time-dependent Schrodinger equation. In 
such a deterministic theory it is hard to see how probabilities can arise. Also, 
the branching of the world into vast numbers of histories is disturbing, to say 
the least. The decoherent histories approach [4] like the Copenhagen inter- 
pretation gives up on the idea that it is possible to completely characterize 
the state of an isolated system at any time by a vector in Hilbert space, 
or by anything else, and instead provides only a set of rules for calculating 
the probabilities of certain kinds of history. This avoids inconsistencies, but 
without any objective characterization of the state of a system, one wonders 
where the rules come from. 

Faced with these perplexities, one is led to consider the possibility that 
quantum mechanics needs correction. There may be a Hilbert space vector 
that completely characterizes the state of a system, but that suffers an in- 
herently probabilistic physical collapse, not limited as in the Copenhagen 
interpretation to measurement by a macroscopic apparatus, but occurring 
at all scales, though presumably much faster for large systems. From time 
to time specific models for this sort of collapse have been proposed[5]. In the 
present article we will consider the properties of theories of the stochastic 
evolution of the state vector in a more general formalism. We assume that 
this evolution depends only on the state vector, with no hidden variables. In 
contrast to earlier work, we concentrate on the linear first-order differential 
equation that in general describes the evolution of the probability distribu- 
tion of the state vector in Hilbert space. We find conditions on this evolution 
so that it leads to final states with probabilities given by the Born rule of 
ordinary quantum mechanics. This general formalism is also applied to the 
special case of a state vector that evolves through quantum jumps. Theories 
of the evolution of the density matrix are examined as an important special 
case of the more general formalism. 
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II. EVOLUTION OF THE STATE VECTOR'S PROBABILITY 

DENSITY 

We consider a general isolated system, which may or may not include 
a macroscopic measuring apparatus and/or an observer. We assume as in 
ordinary quantum mechanics that the state of the system is entirely de- 
scribed by a vector in Hilbert space. The state vector here is taken in a 
sort of Heisenberg picture, in which operators A(t) have a time dependence 
dictated by the Hamiltonian H as ex.p(iHt)A(0) exp(— iHt). But the state 
vector in this sort of theory is not time-independent; it undergoes a stochas- 
tic evolution, slow for microscopic systems but rapid for larger systems, so 
that at any time t there is a probability P(ip, t)dip for the wave function 
to be in a small volume dip around any value ip. Here we are adopting a 
basis that is so far arbitrary, labeled by a discrete index i, so that ip is an 
abbreviation for the whole set of components ipi and ip* , constrained by the 
normalization condition J2i l^i| 2 = 1j an d dtp is defined as 



a measure invariant under unitary transformations of the ipi. The continuum 
case will be considered later, in Sec. IV. 

We assume time-translation invariance, so that if the wave function at 
time t has a definite value ip, then at a later time t' the probability density 
at ip' will be some function H(ip',ip,t' — t) of ip', of ip, and of the elapsed 
time t' — t, but not separately of t or t'. It follows then from the rules 
of probability that if at time t the wave function has a probability density 
P(ip,t), then at time t' the probability density will be 



Differentiating with respect to t' and then setting t' = t gives our funda- 
mental differential equation for the evolution of the probability density: 




(1) 







where K is the kernel 



K^'^= —U(ip',ip,T) 



(4) 



Jr=0 
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which depends on the details of the system under study, including any mea- 
suring apparatus that the system may contain. Eq. (3) resembles the time- 
dependent Schrodinger equation with K in place of —iH, because both follow 
from time-translation invariance, but Eq. (3) describes the evolution of the 
probability density in Hilbert space rather than of the state vector, and so 
K is real rather than anti-Hermitian. Like the time-dependent Schrodinger 
equation, Eq. (3) neither violates nor guarantees Lorentz invariance. Pre- 
sumably, in a Lorentz invariant theory, K would be accompanied with other 
kernels that describe how probabilities change with the position of the ob- 
server. 

The solution of Eq. (3) is of course 

P(iP',t) = y#(e xt )^P(^0), (5) 

with the exponential of Kt defined as usual by its power series expansion. 
To evaluate this exponential, we let /jv(V') be the linearly independent right- 
eigenfunctions of K: 

J dip K^rf f N (ip) = -Ajv/jvO') , (6) 

with eigenvalues —Xn- Because there is no need for K to be Hermitian, 
some of the eigenfunctions and eigenvalues may be complex, but because K 
is real, any complex eigenfunctions and eigenvalues must come in complex 
conjugate pairs. 

We will assume that the /jv(V') form a complete set. This is the generic 
case; other cases can be handled by letting some eigenvalues and eigenfunc- 
tions of K merge with each other. Where the Jn form a complete set we 
may write the kernel as 

K ^',i> = ~ Xn /aKVO 9n{iP) , (7) 

N 

where <7jv(V0 are some coefficient functions, not related in any simple way 
to /jv(^). The eigenvalue condition (6) requires that 

J dip gM(ip) /jvW = $nm ■ (8) 

Then will be a left-eigenfunction of K, also with eigenvalue —Xn- 

J dip' gN{ip') K^rf = -X N gN{ip) ■ (9) 
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(Eq. (7) does not define gjy in the case Xjy = 0; in this case the definition 
is provided by Eqs. (8) and (9).) The completeness relation for the /at can 
then be expressed as 

V^EMVOsivW (io) 

N 

where ly,'^ is defined so that, for any smooth function F(i/j), 

J #V^F(V0 = Fty')- (11) 

It is elementary then to use the power series expansion for the exponential 
to calculate that 

= J2 e ~ XNt M^')9NW. (12) 

N 

The probability distribution for the wave function is therefore 

p(y, t) = J2 e-^'MYO / qnW) p&', o) • (13) 

N J 

(Where the /at miss being a complete set by a finite number of terms, the 
exponentials are in general accompanied with polynomial functions of time.) 

III. LIMIT OF EVOLUTION 




It is clear that in order for the probability distribution to approach any 
sort of limit for t — > oo, all the eigenvalues must have negative real parts; 
that is, ReAjv > 0. If we assume that there is a minimum value to the 
smallest non-zero value of RcAat, then the probability distribution becomes 
dominated by the zero modes: for t — > oo 

P(V, O^E /» WO / ffn(^) p (^ ; o) , (1 4 ) 

n J 

where n runs over the values of N for which A^r = 0. (The contribution of 
eigenmodes with ReA^ = but IulAat ^ presumably oscillates so rapidly 
as t — > oo as to be unobservable.) The f n (if>) can be regarded as fixed points 
of the differential equation (3). The magnitude of the non-zero eigenvalues 
depends on the nature of the system in question. Presumably where a system 
is large, as in measurement by a macroscopic apparatus, the values of the 
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non-zero eigenvalues are large, in which case the approach to the limit (14) 
is exponentially fast. 

Although the limit of the probability distribution for t — > oo depends 
only on the zero-modes f n and g n , in general to calculate the evolution of 
the probability distribution for finite times we need to know all the eigen- 
functions /at and qn- But the whole time dependence of the probability 
distribution can be calculated in terms of the zero modes in the special case 
in which all non-zero \n are equal, say to A. Then Eq. (12) gives 



e Kt 



where v runs over the values of N for which Ajv 7^ 0. The completeness 
relation (10) gives 

/"WO 9u(ip) = V,V> - S /"WO 5n W) 



so 

'e Kt 



and the probability distribution is 

P(V, t) = Pty>, 0)e- At + [l - e~ At ] £ /n W) / #' 0n WO PW', 0) , (15) 

in which we can see explicitly how the probability distribution approaches 
the limit (14) for t — > 00. 

The kernel K (including the zero modes f n and g n along with the non- 
zero eigenvalues — \ v ) depends on the details of the system in question, as 
well as depending on the as yet mysterious dynamics of the collapse process. 
Consider a system containing a subsystem with a complete set of commuting 
observables whose eigenvalues are labeled by an index n, and a measuring 
apparatus that through a unitary evolution of the whole system becomes 
entangled with the subsystem in such a way that, when the subsystem is in 
the n th eigenstate of the observables, the apparatus is in a unique state. It 
is convenient to perform a unitary transformation to a new basis, in which 
(p n is the component of the state vector along such a joint state of the whole 
system. In order to reproduce the results of the Copenhagen interpretation 
the probability distribution at late times must relax to a sum over n of terms 
proportional to Y\ m ^ n 5(\tp m \ 2 ), so that only ip n is allowed to be non-zero in 
the nth term. To reproduce the Born rule, the coefficient of the nth term 
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must be proportional to the initial value of \<£> n \ 2 - Comparing with Eq. (14), 
we see that the zero modes here can be labeled with the same index n, with 

f n (ip) = F n (kxg(p n ) Yl 5(\ip m \ 2 ) , g n (ip) = \ip n \ 2 , (16) 

where T n {0) is an unknown function satisfying Jq 77 J- n (6) dd = 2tt. The nor- 
malization of these zero modes has been chosen to be consistent with Eq. (8), 
which requires that / dipf n (ip)g m (ip) = S nm . Also, since it is only f n g n that 
enter in this requirement, we have made an arbitrary choice of a conve- 
nient normalization for f n , thus fixing the normalization of g n . According 
to Eq. (14), the probability density at late times becomes 

P(<p,t) ^£j;(Arg^) ( J] 5(\y m \ 2 ) ) f dy/l^nl 2 W,0) . (17) 

Note that here Eqs. (9) and (16) give, for each n and 92 

J VK| 2 i^ = o. (is) 

This implies the time-independence of the quantity 

P n = J dp \ Vn \ 2 P(v,t) . (19) 

This makes sense, because P n according to the Born rule is the probability 
that, when the collapse is finished, the state of the system will be found 
in the basis state n, and this of course must be independent of t. Since 
Y^ n l^nl 2 = 1) the sum of Eq. (18) over n yields 

J dip' K^ v = , (20) 

which is the condition that Eq. (3) respects the conservation of the total 
probability / dipP(ip,t). 

In usual measurements, the measuring apparatus does not evolve into a 
unique state when the subsystem is in the nth eigenstate of a set of observ- 
ables, but into any one of a number of apparatus states, labeled with another 
index r. It is convenient again to choose a corresponding basis, so that the 
components of the wave function are labeled tp nr , with J2nr l^nr] 2 = 1- In 
this case, assuming all apparatus states r for a given subsystem state n are 



7 



equally probable, consistency with the results of the Copenhagen interpre- 
tation and the condition / dip f n g m = 5 nm requires that 

fn(<p)=J r n ]J ^(I'Anrl 2 ) , 5n(<£>) = ^ |<£>rar | 2 , (21) 

where T n is an unknown function of the phases of all (p nr , whose average over 
phases is unity, and the individual normalization of f n and g n has again been 
chosen for convenience. The individual probabilities / g n (ip)P(ip)dip and the 
total probability J P(ip)d(p are conserved here for the same reason as before. 

From the point of view adopted here, there is nothing special about 
measurement. Measurement is just a process in which the state vector of a 
system (typically microscopic) becomes entangled with the state vector of 
a relatively large system, which then undergoes a collapse to an eigenstate 
of some operators determined by the characteristics of that system. So we 
expect that the state vector of any system undergoes a similar collapse, but 
one that is much faster for large systems. But collapse to what? Without 
attempting a precise general prescription, we have in mind that these are the 
sorts of states familiar in classical physics. For instance, in a Stern-Gerlach 
experiment, they would be states in which a macroscopic detector registers 
that an atom has a definite trajectory, not a superposition of trajectories. 
In Schrodinger's macabre thought experiment [6] , they are states in which 
the cat is alive, or dead, but not a superposition of alive and dead. These 
states are like the "pointer states" of Zurek[7], but here these basis states 
are determined by the physics of the assumed collapse of the state vector, 
rather than by the decoherence produced by interaction with small external 
perturbations. 

IV. CONTINUUM STATES 

It is straightforward to adapt this formalism to the continuum case, 
where the wave functions depend on a continuous variable x rather than 
a discrete label i. In the continuum case, we take ip as an abbreviation 
for the functions ip(x) and tp*(x), normalized so that J dx\ip(x)\ 2 = 1; the 
probability distribution P[ip,i\ and the kernel K^^i are functionals of these 
functions; and J dip is a functional integral, with a normalization that can 
be chosen as convenience dictates. There is no reason here to expect a 
gap between the zero and non-zero eigenvalues of K, and in the example 
discussed in Sec. VII there is no such gap, so we will not here bother 
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to separate the zero-modes from the eigenfunctionals of K with non-zero 
eigenvalue. The kernel can be expressed as 



= -JdNX N f N W\g N [^] , (22) 

where 

J dip K^fNty] = -Ajv/jvW) , / # 9n>WnW\ = S(N' - N) . (23) 
Using the completeness relation 

1*,* = J dN f N W\g N [il>];, (24) 

we have 

'e Kt ] „ = I dN f N W\g N W- XNt (25) 
and the probability distribution at time t is 

Pty,t] = JdN f N [^]e- XNt j dip 1 g N mP[iP',0] . (26) 

As before, to avoid runaway solutions we need to assume that ReAAr > for 
all eigenvalues, in which case with increasing time Eq. (26) is increasingly 
dominated by the eigenmodes with smallest Aat. But without a gap between 
zero and non-zero eigenvalues, the probability distribution may not approach 
any specific limit exponentially as t — > oo. 

V. QUANTUM JUMPS 

Our discussion so far has been very general, not dependent on any specific 
picture of the evolution of the state vector. We can be a little more concrete, 
by assuming that the wave function undergoes a series of quantum jumps, 
from ip to Jip, where J is a non- linear operator depending on one or more 
random parameters. If the rate of jumps is T, and the wave function at some 
time t is if)', then at a slightly later time t + dt the probability distribution at 
ip is (1 — Ydt)\^ +Tdt (l^i ji/,'), where brackets indicate an average over the 
random parameters on which the operator J depends. Hence the evolution 
of the probability distribution is given by Eq. (3), with kernel 

K^> = -T (l^tf - • (27) 
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We note in particular that, for any function (or functional) g{ip) of the wave 
function, we have 

J d*p <7(V) Km = -r(<7(v0 - < 5 ( JVO) ) , (28) 

so the condition for g(ip) to be a left eigenfunction of the kernel is that 

G/W))=A</(V0, (29) 

in which case the corresponding eigenvalue is 

A = -r(l-A). (30) 

The left-eigenfunctions of the kernel with zero eigenvalue are those functions 
g{ip) that on average are unaffected by quantum jumps. 

VI. DENSITY MATRIX 

The class of theories presented here are more general than in any based 
on an assumed differential equation for the density matrix, as there is much 
more information contained in the probability distribution P(ip) than in 
the density matrix. (For instance, for a system with two discrete states, 
the density matrix is specified by only three real parameters, while the 
probability distribution is an unknown real function of one modulus and 
two phases.) The density matrix in is defined in a general discrete basis by 

Pij {t) = J di> P(4>,t)4>iip* . (31) 

In particular, Eq. (3) gives the rate of change of the density matrix 

j t Pij(t) = jdip J dip K^P(^,t)^ . (32) 

In order for the right-hand side to be expressible in terms of p, we would 
need the space of bilinear functions of ip to be invariant under the left action 
of the kernel K: 

I dip K^^ ipiip* = ^i'tp'j* ■ (33) 

i'f 

This condition is preserved if we make a change of basis by a unitary trans- 
formation of the wave function, of course with a transformed k matrix. Not 
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all conceivable kernels satisfy a condition like Eq. (33) . Where this condition 
holds, the density matrix obeys the differential equation 



d_ 
~dt 



Pi]® = 



Pi'j'if) . 



(34) 



There are reasons to suppose that this must be the case. It is a familiar 
feature of quantum mechanics that different statistical ensembles of indi- 
vidual states can yield the same density matrix. Gisin[8] has shown that 
for any two such ensembles of states of a given physical system that have 
the same density matrix p, it is always possible to invent a second isolated 
physical system that can be entangled with the first, in such a way that 
measurements in the second system can drive the first system to one or the 
other of the two ensembles with density matrix p. This does not lead to any 
possibility of communication between the two systems, provided the den- 
sity matrix contains all information concerning any possible observation of 
the first system, and provided that the subsequent evolution of the density 
matrix depends only on the density matrix, not on the particular statistical 
ensemble it represents. But if Eq. (33) were not satisfied, then the evolution 
of the density matrix would depend on the specific statistical ensemble of 
state vectors, not just on the density matrix, and instantaneous communi- 
cation between isolated systems would be possible. 

Where the probability distribution approaches the limit (17) at late time, 
in the basis ip n described in Section III, the density matrix becomes diagonal 



where the P n are constants given by Eq. (19). Of course, by a unitary 
transformation the density matrix can be put in a diagonal form at any 
time, but with diagonal elements and in a basis that change with time. 
Eq. (35) tells us that the density matrix approaches a diagonal form with 
in a fixed basis and with fixed diagonal elements, equal to the expectation 
values (19) of the density matrix at any time. 



Finally, it is interesting to examine how the theory proposed in the well- 
known paper of Ghirardi, Rimini, and Weber [5] (henceforth GRW) appears 
in the more general formalism presented here. GRW suggested a stochastic 
evolution of the state vector, leading to its localization, and expressed their 




(35) 



VII. THE GRW CASE 
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model in a differential equation for the density matrix for a single particle 
(written here using the Heisenberg picture described above, and in a notation 
slightly different from that of GRW) 

J t P*'At) = (l - e- a «-*?' 2 ) p x ,, x {t) , (36) 

with to > and a > 0. (Here x is the eigenvalue of the one-dimensional 
Heisenberg-picture position operator x(t) = x(0) +pt/m.) Thus the condi- 
tion (33) here reads 

J # Km iP(x')iP* (x) = -u(l- e -«(-'-) 2 /2) ^(a/y ( x ) , (37) 

so the kernel has eigenvalues 

-\ xx , = - U (l- e -«(*'-*) 2 /^ < o , (38) 
with left-eigenfunctionals 

g xx ,W\ =i>{x'W{x) . (39) 

But this does not determine the kernel, since without changing Eq. (37) we 
can change K by adding any kernel for which 

J d^K^,^(x)r(x') = 0. (40) 

The zero modes (among others that would depend on K^) are the g xx '[ip] 
given by Eq. (39), with x = x' . This is a case where there is no gap between 
the negative eigenvalues and zero, and the probability distribution does not 
approach any definite limit, though the density matrix becomes increasingly 
diagonal as t — > oo. 

Bell [5] subsequently gave formulas for a jump operator J and for the 
probability distribution for the random parameter in J that would yield the 
GRW equation (36) for the evolution of the density matrix. (Bell's formulas 
do not follow uniquely from the GRW equation (36) for the evolution of the 
density matrix, but they do follow from other assumptions in the GRW pa- 
per.) In the one-particle one-dimensional case Bell's results (in a somewhat 
different notation) gives 

[J&](x)=j{x-M{x)/Ry>,0, (41) 
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where j(x) = (2a/ir) 1 ^ 4: exp(— ax 2 ); £ is a random parameter with prob- 
ability density i? 2 (^,£), and R(ip,£,) is determined by the normalization 
condition on Jip: 

R\M = Jd 3 x\j(x-mx)\ 2 ■ (42) 

Using Eq. (27), and setting the jump frequency F equal to co, we can use 
Eq. (41) to find the kernel K. Because of the ^-dependence of R(ip,£), 
it is not so easy here to find general solutions of the eigenvalue condition 
(29). But it is easy to see that Eq. (29) is satisfied for the functionals 
9x,x'bP] = ip(x')ip*(x) for arbitrary x and x'. In these cases the factors 1/R 
in [J^ip](x') and [J£ij)]*(x) are cancelled by the probability distribution R 2 
for £, and we find A = exp(— a(x — x') 2 /2). Using Eq. (30) then shows that 
ip(x')ip*(x) are left-eigenfunctionals of K with eigenvalues (38), so Eq. (37) 
is satisfied, and this yields the GRW equation (36). 
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